Unsupervised classification of uncertain data objects in spatial databases using computational geometry and indexing techniques
نویسنده
چکیده
Unsupervised classification called clustering is a process of organizing objects into groups whose members are similar in some way. Clustering of uncertain data objects is a challenge in spatial data bases. In this paper we use Probability Density Functions (PDF) to represent these uncertain data objects, and apply Uncertain K-Means algorithm to generate the clusters. This clustering algorithm uses the Expected Distance (ED) to compute the distance between objects and cluster representatives. To further improve the performance of UK-Means we propose a novel technique called Voronoi Diagrams from Computational Geometry to prune the number of computations of ED. This technique works efficiently but results pruning overheads. In order to reduce these in pruning overhead we introduce R*-tree indexing over these uncertain data objects, so that it reduces the computational cost and pruning overheads. Our novel approach of integrating UK-Means with voronoi diagrams and R* Tree applied over uncertain data objects generates imposing outcome when compared with the accessible methods.
منابع مشابه
An Approach for clustering uncertain data objects: A Survey
Recently, uncertain data objects is used in various applications such as VANET environment, sensors applications, image processing based system etc. Clustering of uncertain data is a major concept in data mining since more and more applications, such as sensor database, location database, biometric information systems, and produce vague and imprecise data. Clustering of uncertain data objects i...
متن کاملEmbedding advanced geometric techniques into SQL for efficient indexing of mobile objects
It is of great importance a trial to embed new geometric techniques into SQL in order to achieve more efficient indexing of objects moving on the plane and answer range queries about their future positions. This problem is motivated by real-life applications, such as allocating more bandwidth for areas where high concentration of mobile phones is imminent, or predicting future congestion areas ...
متن کاملExtending the Qualitative Trajectory Calculus Based on the Concept of Accessibility of Moving Objects in the Paths
Qualitative spatial representation and reasoning are among the important capabilities in intelligent geospatial information system development. Although a large contribution to the study of moving objects has been attributed to the quantitative use and analysis of data, such calculations are ineffective when there is little inaccurate data on position and geometry or when explicitly explaining ...
متن کاملIndexing Constraint Databases by Using a Dual Representation
Linear constraint databases are a powerful framework to model spatial and temporal data. The use of constraint databases should be supported by access data structures that make effective use of secondary storage and reduce query processing time. Such structures should be able to store both finite and infinite objects and perform both containment (ALL) and intersection (EXIST) queries. As standa...
متن کاملSearch Problems for Speech and Audio Sequences
The modern proliferation of very large audio and video databases has created a need for effective methods of indexing and searching highly variable or uncertain data. Classical search and indexing algorithms deal with clean input sequences. However, an index created from speech or music transcriptions is marked with errors and uncertainties stemming from the use of imperfect statistical models ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1312.2378 شماره
صفحات -
تاریخ انتشار 2012